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METHOD FOR DISTINGUISHING MLL-PTD-POSITIVE AML FROM OTHER AML 

SUBTYPES 

Background of the Invention 
Field of the Invention 

The present invention is directed to a method for distinguishing MLL-PTD-positive 
AML from other AML subtypes by determining the expression level of selected marker 
genes. 

5 Description of Related Art 

According to Golub et al. (Science, 1999, 286, 531-7), gene expression profiles can be 
used for class prediction and discriminating AML from ALL samples. However, for the 
analysis of acute leukemias the selection of the two different subgroups was performed 
using exclusively morphologic-phenotypical criteria. This was only descriptive and does 
10 not provide deeper insights into the pathogenesis or the underlying biology of the 

leukemia. The approach reproduces only very basic knowledge of cytomorphology and 
intends to differentiate classes. The data is not sufficient to predict prognostically 
relevant cytogenetic aberrations. 

15 Furthermore, the international application WO-A 03/039443 discloses marker genes the 
expression levels of which are characteristic for certain leukemia, e.g. AML subtypes 
and additionally discloses methods for differentiating between the subtype of AML cells 
by determining the expression profile of the disclosed marker genes. However, WO-A 
03/039443 does not provide guidance which set of distinct genes discriminate between 

20 two subtypes and, as such, can be routineously taken in order to distinguish one AML 
subtype from another. 

Summary of the Invention 

Leukemias are classified into four different groups or types: acute myeloid (AML), acute 
25 lymphatic (ALL), chronic myeloid (CML) and chronic lymphatic leukemia (CLL). 
Within these groups, several subcategories can be identified further using a panel of 
standard techniques as described below. These different subcategories in leukemias are 
associated with varying clinical outcome and therefore are the basis for different 
treatment strategies. The importance of highly specific classification may be illustrated 
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in detail further for the AML as a very heterogeneous group of diseases. Effort is aimed 
at identifying biological entities and to distinguish and classify subgroups of AML which 
are associated with a favorable, intermediate or unfavorable prognosis, respectively. In 
1976, the FAB classification was proposed by the French-American-British co-operative 
5 group which was based on cytomorphology and cytochemistry in order to separate AML 
subgroups according to the morphological appearance of blasts in the blood and bone 
marrow. In addition, it was recognized that genetic abnormalities occurring in the 
leukemic blast had a major impact on the morphological picture and even more on the 
prognosis. So far, the karyotype of the leukemic blasts is the most important independent 
10 prognostic factor regarding response to therapy as well as survival. 

Usually, a combination of methods is necessary to obtain the most important information 
in leukemia diagnostics: Analysis of the morphology and cytochemistry of bone marrow 
blasts and peripheral blood cells is necessary to establish the diagnosis. In some cases the 

1 5 addition of immunophenotyping is mandatory to separate very undifferentiated AML 
from acute lymphoblastic leukemia and CLL. Leukemia subtypes investigated can be 
diagnosed by cytomorphology alone, only if an expert reviews the smears. However, a 
genetic analysis based on chromosome analysis, fluorescence in situ hybridization or 
RT-PCR and immunophenotyping is required in order to assign all cases in to the right 

20 category. The aim of these techniques besides diagnosis is mainly to determine the 
prognosis of the leukemia. A major disadvantage of these methods, however, is that 
viable cells are necessary as the cells for genetic analysis have to divide in vitro in order 
to obtain metaphases for the analysis. Another problem is the long time of 72 hours from 
receipt of the material in the laboratory to obtain the result. Furthermore, great 

25 experience in preparation of chromosomes and even more in analyzing the karyotypes is 
required to obtain the correct result in at least 90% of cases. Using these techniques in 
combination, hematological malignancies in a first approach are separated into chronic 
myeloid leukemia (CML), chronic lymphatic (CLL), acute lymphoblastic (ALL), and 
acute myeloid leukemia (AML). Within the latter three disease entities several 

30 prognostically relevant subtypes have been established. As a second approach this 

further sub-classification is based mainly on genetic abnormalities of the leukemic blasts 
and clearly is associated with different prognoses. 

The sub-classification of leukemias becomes increasingly important to guide therapy. 
35 The development of new, specific drugs and treatment approaches requires the 

identification of specific subtypes that may benefit from a distinct therapeutic protocol 
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and, thus, can improve outcome of distinct subsets of leukemia. For example, the new 
therapeutic drug (STI571, Imatinib) inhibits the CML specific chimeric tyrosine kinase 
BCR-ABL generated from the genetic defect observed in CML, the BCR-ABL- 
rearrangement due to the translocation between chromosomes 9 and 22 (t(9;22) (q34; 
5 ql 1)). In patients treated with this new drug, the therapy response is dramatically higher 
as compared to all other drugs that had been used so far. Another example is the subtype 
of acute myeloid leukemia AML M3 and its variant M3v both with karyotype 
t(15;17)(q22; ql 1-12). The introduction of a new drug (all-trans retinoic acid - ATRA) 
has improved the outcome in this subgroup of patient from about 50% to 85 % long-term 

10 survivors. As it is mandatory for these patients suffering from these specific leukemia 
subtypes to be identified as fast as possible so that the best therapy can be applied, 
diagnostics today must accomplish sub-classification with maximal precision. Not only 
for these subtypes but also for several other leukemia subtypes different treatment 
approaches could improve outcome. Therefore, rapid and precise identification of 

1 5 distinct leukemia subtypes is the future goal for diagnostics. 



Thus, the technical problem underlying the present invention was to provide means for 
leukemia diagnostics which overcome at least some of the disadvantages of the prior art 
diagnostic methods, in particular encompassing the time-consuming and unreliable 
20 combination of different methods and which provides a rapid assay to unambiguously 
distinguish one AML subtype from another, e.g. by genetic analysis. 



Brief Description of the Invention 

The problem is solved by the present invention, which provides a method for 
25 distinguishing MLL-PTD-positive AML from other AML subtypes in a sample, the 
method comprising determining the expression level of markers selected from the 
markers identifiable by their Affymetrix Identification Numbers (affy id) as defined in 
Tables 1,2, and/or 3, 

wherein 

30 a lower expression of at least one polynucleotide defined by any of the 

numbers 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 
22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 
43, 44, 45, 46, 47, 48, 49, and/or 50 of Table 1 
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is indicative for the presence of PTD (MLL-PTD-positive AML with normal 
karyotype) when PTD is distinguished from AML NK (MLL-PTD-negative 
AML with normal karyotype), 

and/or wherein 

5 a lower expression of at least one polynucleotide defined by any of the 

numbers 1, 2, 3, 4, 5, 6, 7, 8, 9, 11, 12, 14, 15, 16, 18, 19, 20, 21, 22, 23, 26, 
27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 42, 44, 45, 47, 48, 49, 
and/or 50 of Table 2.1, and/or 

a higher expression of at least one polynucleotide defined by any of the 
10 numbers 10, 13, 17, 24, 25, 41, 43, and/or 46, of Table 2.1, 

is indicative for M4eo when M4eo is distinguished from all other subtypes, 

and/or wherein 

a lower expression of at least one polynucleotide defined by any of the 
numbers 1, 2, 3, 4, 6, 7, 8, 9, 10, 11, 12, 14, 15, 16, 17, 19, 20, 21, 22, 23, 24, 
15 25, 26, 28, 29, 31, 32, 33, 34, 35, 36, 38, 39, 41, 42, 44, 45, 46, 48, 49, and/or 

50 of Table 2.2, and/or 

a higher expression of 5, 13, 18, 27, 30, 37, 40, 43, and/or 47, of Table 2.2 
is indicative for PTD when PTD is distinguished from all other subtypes, 
and/or wherein 

20 a lower expression of at least one polynucleotide defined by any of the 

numbers 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 
22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 35, 36, 37, 38, 39, 40, 41, 42, 
43, 44, 45, 46, 47, 49, and/or 50 of Table 2.3, and/or 

a higher expression of at least one polynucleotide defined by any of the 
25 numbers 34, and/or 48, of Table 2.3 

is indicative for inv3 when inv3 is distinguished from all other subtypes, 

and/or wherein 

a lower expression of at least one polynucleotide defined by any of the 
numbers 1, 2, 3, 5, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 23, 25, 26, 
30 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 41, 42, 43, 44, 45, 46, 47, 

48, and/or 50 of Table 2.4, and/or 
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a higher expression of at least one polynucleotide defined by any of the 
numbers 4, 6, 7, 8, 22, 24, 40, and/or 49, of Table 2.4 

is indicative for t(15;17) when t(15;17) is distinguished from all other 
subtypes, 

5 and/or wherein 

a lower expression of at least one polynucleotide defined by any of the 
numbers 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 
22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 

42, 43, 44, 45, 46, 47, 48, 49, and/or 50 of Table 2.5 

10 is indicative for t(8;21) when t(8;21) is distinguished from all other subtypes, 

and/or wherein 

a lower expression of at least one polynucleotide defined by any of the 
numbers 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 13, 14, 16, 17, 18, 19, 20, 21, 22, 23, 
24, 25, 26, 27, 28, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 42, 43, 45, 46, 
15 47, 48, 49, and/or 50 of Table 2.6, and/or 

a higher expression of at least one polynucleotide defined by any of the 
numbers 12, 15, 29, 41, and/or 44, of Table 2.6 

is indicative for tMLL when tMLL is distinguished from all other subtypes, 
and/or wherein 

20 a lower expression of at least one polynucleotide defined by any of the 

numbers 1, 2, 4, 5, 7, 10, 12, 13, 16, 17, 19, 23, 25, 30, 31, 32, 33, 34, 37, 41, 

43, 45, 47, 48, and/or 50 of Table 3.1,and/or 

a higher expression a polynucleotide defined by any of the numbers 3, 6, 8, 
9, 11, 14, 15, 18, 20, 21, 22, 24, 26, 27, 28, 29, 35, 36, 38, 39, 40, 42, 44, 46, 
25 and/or 49, of Table 3.1, 

is indicative for M4eo when M4eo is distinguished from PTD, 

and/or wherein 

a lower expression of at least one polynucleotide defined by any of the 
numbers 5, 6, 9, 12, 23, 28, 38, 41, 44, 45, 46, and/or 47, of Table 3.2, and/or 

30 a higher expression of at least one polynucleotide defined by any of the 

numbers 1,2,3,4, 7, 8, 10, 11, 13, 14, 15, 16, 17, 18, 19, 20,21,22, 24, 25, 
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26, 27, 29, 30, 31, 32, 33, 34, 35, 36, 37, 39, 40, 42, 43, 48, 49, and/or 50 of 
Table 3.2, 

is indicative for M4eo when M4eo is distinguished from inv3, 

a lower expression of at least one polynucleotide defined by any of the 
5 numbers 2, 3, 4, 6, 11, 14, 20, 22, 26, 31, 32, 33, 34, 39, 40, 41, and/or 48, of 

Table 3.3, and/or 

a higher expression of at least one polynucleotide defined by any of the 
numbers 1, 5, 7, 8, 9, 10, 12, 13, 15, 16, 17, 18, 19, 21, 23, 24, 25, 27, 28, 29, 
30, 35, 36, 37, 38, 42, 43, 44, 45, 46, 47, 49, and/or 50 of Table 3.3, 

10 is indicative for M4eo when M4eo is distinguished from t(15;17), 

and/or wherein 

a lower expression of at least one polynucleotide defined by any of the 
numbers 7, 31, 40, and/or 49, of Table 3.4, and/or 

a higher expression of at least one polynucleotide defined by any of the 
15 numbers 1, 2, 3, 4, 5, 6, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 

23, 24, 25, 26, 27, 28, 29, 30, 32, 33, 34, 35, 36, 37, 38, 39, 41, 42, 43, 44, 
45, 46, 47, 48, and/or 50 of Table 3.4 

is indicative for M4eo when M4eo is distinguished from t(8;21), 
and/or wherein 

20 a lower expression of at least one polynucleotide defined by any of the 

numbers 1, 3, 10, 14, 17, 18, 19, 21, 24, 25, 26, 31, 32, 34, 41, 44, and/or 50 
of Table 3.5, and/or 

a higher expression of at least one polynucleotide defined by any of the 
numbers 2, 4, 5, 6, 7, 8, 9, 11, 12, 13, 15, 16, 20, 22, 23, 27, 28, 29, 30, 33, 
25 35, 36, 37, 38, 39, 40, 42, 43, 45, 46, 47, 48, and/or 49, of Table 3.5 

is indicative for M4eo when M4eo is distinguished from tMLL, 

and/or wherein 

a lower expression of at least one polynucleotide defined by any of the 
numbers 4, 6, 9, 28, 30, 32, 35, 37, 44, 45, and/or 48, of Table 3.6, and/or 

30 a higher expression of at least one polynucleotide defined by any of the 

numbers 1,2,3,5,7, 8, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20,21,22, 23, 
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24, 25, 26, 27, 29, 31, 33, 34, 36, 38, 39, 40, 41, 42, 43, 46, 47, 49, and/or 50 
of Table 3.6 

is indicative for PTD when PTD is distinguished from inv3, 
and/or wherein 

5 a lower expression of at least one polynucleotide defined by any of the 

numbers 1, 2, 3, 4, 6, 7, 10, 11, 12, 13, 14, 15, 16, 17, 18, 20, 23, 27, 28, 29, 
30, 31, 32, 33, 34, 36, 38, 39, 41, 43, 44, 45, 47, 48, and/or 50 of Table 3.7, 
and/or 

a higher expression of polynucleotide defined by any of the numbers 5, 8, 9, 
10 19, 21, 22, 24, 25, 26, 35, 37, 40, 42, 46, and/or 49, of Table 3.7, 

is for PTD when PTD is distinguished from t(15;17), 

and/or wherein 

a lower expression of at least one polynucleotide defined by any of the 
numbers 7, 9, 10, 11, 13, 16, 20, 21, 22, 23, 30, 35, 36, 38, 42, 45, and/or 50 
15 of Table 3.8, and/or 

a higher expression of at least one polynucleotide defined by any of the 
numbers 1, 2, 3, 4, 5, 6, 8, 12, 14, 15, 17, 18, 19, 24, 25, 26, 27, 28, 29, 31, 
32, 33, 34, 37, 39, 40, 41, 43, 44, 46, 47, 48, and/or 49, of Table 3.8 

is indicative for PTD when PTD is distinguished from t(8;21), 

20 and/or wherein 

a lower expression of at least one polynucleotide defined by any of the 
numbers 1, 5, 8, 10, 11, 13, 15, 17, 19, 25, 26, 28, 29, 34, and/or 46, of Table 
3.9, and/or 

a higher expression of at least one polynucleotide defined by any of the 
25 numbers 2, 3, 4, 6, 7, 9, 12, 14, 16, 18, 20, 21, 22, 23, 24, 27, 30, 31, 32, 33, 

35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 47, 48, 49, and/or 50 of Table 3.9 

is indicative for PTD when PTD is distinguished from tMLL, 

and/or wherein 

a lower expression of at least one polynucleotide defined by any of the 
30 numbers 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 

23, 24, 25, 26, 28, 29, 32, 33, 36, 38, 39, 40, 43, 44, 45, 46, 47, and/or 49, of 
Table 3.10, and/or 
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a higher expression of at least one polynucleotide defined by any of the 
numbers 22, 27, 30, 31, 34, 35, 37, 41, 42, 48, and/or 50 of Table 3.10, 

is indicative for inv(3) when inv(3) is distinguished from t(15;17), 

and/or wherein 

5 a lower expression of at least one polynucleotide defined by any of the 

numbers 1, 5, 6, 9, 11, 12, 15, 17, 18, 19, 23, 27, 35, 36, 37, 39, 42, 43, 47, 
49, and/or 50 of Table 3.11, and/or 

a higher expression of at least one polynucleotide defined by any of the 
numbers 2, 3, 4, 7, 8, 10, 13, 14, 16, 20, 21, 22, 24, 25, 26, 28, 29, 30, 31, 32, 
10 33, 34, 38, 40, 41, 44, 45, 46, and/or 48, of Table 3.11 

is indicative for inv(3) when inv(3) is distinguished from t(8;21), 

and/or wherein 

a lower expression of at least one polynucleotide defined by any of the 
numbers 1, 3, 4, 6, 7, 8, 12, 14, 15, 16, 17, 18, 19, 20, 21, 23, 25, 26, 28, 29, 
15 30, 31, 33, 34, 35, 37, 38, 39, 42, 43, 44, 45, 47, 48, and/or 50 of Table 3.12, 

and/or 

a higher expression of at least one polynucleotide defined by any of the 
numbers 2, 5, 9, 10, 11, 13, 22, 24, 27, 32, 36, 40, 41, 46, and/or 49, of Table 
3.12 

20 is indicative for inv(3) when inv(3) is distinguished from tMLL, 

and/or wherein 

a lower expression of at least one polynucleotide defined by any of the 
numbers 3, 4, 7, 14, 16, 20, 22, 23, 24, 25, 26, 30, 35, 36, 37, 39, 40, 43, 44, 
46, and/or 50 of Table 3.13, and/or 

25 a higher expression of at least one polynucleotide defined by any of the 

numbers 1, 2, 5, 6, 8, 9, 10, 11, 12, 13, 15, 17, 18, 19, 21, 27, 28, 29, 31, 32, 
33, 34, 38, 41, 42, 45, 47, 48, and/or 49 of Table 3.13, 

is indicative for t(15;17) when t(15;17) is distinguished from t(8;21), 

and/or wherein 

30 a lower expression of at least one polynucleotide defined by any of the 

numbers 13, 15, 25, 26, 27, 28, 30, 32, 33, 35, 36, 38, 39, 43, 48, and/or 49, 
of Table 3.14, and/or 
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a higher expression of at least one polynucleotide defined by any of the 
numbers 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 14, 16, 17, 18, 19, 20, 21, 22, 23, 
24, 29, 31, 34, 37, 40, 41, 42, 44, 45, 46, 47, and/or 50 of Table 3.14, 

is indicative for t(15;17) when t(15;17) is distinguished from tMLL, 

5 and/or wherein 

a lower expression of at least one polynucleotide defined by any of the 
numbers 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 13, 15, 16, 18, 19, 21, 23, 24, 25, 26, 
27, 28, 29, 30, 32, 33, 34, 35, 36, 38, 39, 40, 41, 42, 43, 44, 47, 48, of Table 
3.15, and/or 

10 a higher expression of at least one polynucleotide defined by any of the 

numbers 12, 14, 17, 20, 22, 31, 37, 45, 46, 49, and/or 50 of Table 3.15, 

is indicative for t(8;21) when t(8;21) is distinguished from tMLL. 



15 



20 



25 



As used herein, the following definitions apply to the above used abbreviations (see also 
example 1): 

tMLL: AML with translocations in the MLL gene (t(l Iq23)/MLL) 

PTD: AML with normal karyotype and Partial Tandem Duplication (PTD) 

within the MLL gene (MLL-PTD) 

AML NK AML with normal karyotype (no Partial Tandem Duplication (PTD) 
within the MLL gene) 

t(8;21) AML with translocation t(8;21) 

t(15;17) AML with translocation t(15;17) 

t(inv3) AML with inversion 3 

M4eo AML with inversion 1 6 (inv( 1 6)) 



As used herein, "all other subtypes" refer to the subtypes of the present invention, i.e. if 
one subtype is distinguished from "all other subtypes", it is distiguished from all other 
subtypes contained in the present invention. 



30 



According to the present invention, a "sample" means any biological material containing 
genetic information in the form of nucleic acids or proteins obtainable or obtained from 
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an individual. The sample includes e.g. tissue samples, cell samples, bone marrow and/or 
body fluids such as blood, saliva, semen. Preferably, the sample is blood or bone 
marrow, more preferably the sample is bone marrow. The person skilled in the art is 
aware of methods, how to isolate nucleic acids and proteins from a sample. A general 
5 method for isolating and preparing nucleic acids from a sample is outlined in Example 3. 

According to the present invention, the term "lower expression" is generally assigned to 
all by numbers and Affymetrix Id. definable polynucleotides the t- values and fold change 
(fc) values of which are negative, as indicated in the Tables. Accordingly, the term 
10 "higher expression" is generally assigned to all by numbers and Affymetrix Id. definable 
polynucleotides the t- values and fold change (fc) values of which are positive. 

According to the present invention, the term "expression" refers to the process by which 
mRNA or a polypeptide is produced based on the nucleic acid sequence of a gene, i.e. 
15 „expression" also includes the formation of mRNA upon transcription. In accordance 

with the present invention, the term determining the expression level" preferably refers 
to the determination of the level of expression, namely of the markers. 

Generally, "marker" refers to any genetically controlled difference which can be used in 
20 the genetic analysis of a test versus a control sample, for the purpose of assigning the 
sample to a defined genotype or phenotype. As used herein, "markers" refer to genes 
which are differentially expressed in, e.g., different AML subtypes. The markers can be 
defined by their gene symbol name, their encoded protein name, their transcript 
identification number (cluster identification number), the data base accession number, 
25 public accession number or GenBank identifier or, as done in the present invention, 

Affymetrix identification number, chromosomal location, UniGene accession number 
and cluster type, LocusLink accession number (see Examples and Tables). 

The Affymetrix identification number (affy id) is accessible for anyone and the person 
30 skilled in the art by entering the "gene expression omnibus" internet page of the National 
Center for Biotechnology Information (NCBI) (http://www.ncbi.nlm.nih.gov/geo/). In 
particular, the affy id's of the polynucleotides used for the method of the present 
invention are derived from the so-called U133 chip. The sequence data of each 
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identification number can be viewed at 
http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GPL96 

Generally, the expression level of a marker is determined by the determining the 
5 expression of its corresponding "polynucleotide" as described hereinafter. 

According to the present invention, the term polynucleotide" refers, generally, to a 
DNA, in particular cDNA, or RNA, in particular a cRNA, or a portion thereof or a 
polypeptide or a portion thereof. In the case of RNA (or cDNA), the polynucleotide is 

10 formed upon transcription of a nucleotide sequence which is capable of expression. The 
polynucleotide fragments refer to fragments preferably of between at least 8, such as 10, 
12, 15 or 18 nucleotides and at least 50, such as 60, 80, 100, 200 or 300 nucleotides in 
length, or a complementary sequence thereto, representing a consecutive stretch of 
nucleotides of a gene, cDNA or mRNA. In other terms, polynucleotides include also any 

1 5 fragment (or complementary sequence thereto) of a sequence derived from any of the 
markers defined above as long as these fragments unambiguously identify the marker. 

The determination of the expression level may be effected at the transcriptional or 
translational level, i.e. at the level of mRNA or at the protein level. Protein fragments 

20 such as peptides or polypeptides advantageously comprise between at least 6 and at least 
25, such as 30, 40, 80, 100 or 200 consecutive amino acids representative of the 
corresponding full length protein. Six amino acids are generally recognized as the lowest 
peptidic stretch giving rise to a linear epitope recognized by an antibody, fragment or 
derivative thereof. Alternatively, the proteins or fragments thereof may be analysed using 

25 nucleic acid molecules specifically binding to three-dimensional structures (aptamers). 

Depending on the nature of the polynucleotide or polypeptide, the determination of the 
expression levels may be effected by a variety of methods. For determining and detecting 
the expression level, it is preferred in the present invention that the polynucleotide, in 
30 particular the cRNA, is labelled. 

The labelling of the polynucleotide or a polypeptide can occur by a variety of methods 
known to the skilled artisan. The label can be fluorescent, chemiluminescent, 
bioluminescent, radioactive (such as 3 H or 32 P). The labelling compound can be any 
35 labelling compound being suitable for the labelling of polynucleotides and/or 
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polypeptides. Examples include fluorescent dyes, such as fluorescein, 
dichlorofluorescein, hexachlorofluorescein, BODIPY variants, ROX, 
tetramethylrhodamin, rhodamin X, Cyanine-2, Cyanine-3, Cyanine-5, Cyanine-7, 
ERD40, FluorX, Oregon Green, Alexa variants (available e.g. from Molecular Probes or 
5 Amersham Biosciences) and the like, biotin or biotinylated nucleotides, digoxigenin, 

radioisotopes, antibodies, enzymes and receptors. Depending on the type of labelling, the 
detection is done via fluorescence measurements, conjugation to streptavidin and/or 
avidin, antigen-antibody- and/or antibody-antibody-interactions, radioactivity 
measurements, as well as catalytic and/or receptor/ligand interactions. Suitable methods 

10 include the direct labelling (incorporation) method, the amino-modified (amino-allyl) 
nucleotide method (available e.g. from Ambion), and the primer tagging method (DNA 
dendrimer labelling, as kit available e.g. from Genisphere). Particularly preferred for the 
present invention is the use of biotin or biotinylated nucleotides for labelling, with the 
latter being directly incorporated into, e.g. the cRNA polynucleotide by in vitro 

15 transcription. 

If the polynucleotide is mRNA, cDNA may be prepared into which a detectable label, as 
exemplified above, is incorporated. Said detectably labelled cDNA, in single-stranded 
form, may then be hybridised, preferably under stringent or highly stringent conditions to 

20 a panel of single-stranded oligonucleotides representing different genes and affixed to a 
solid support such as a chip. Upon applying appropriate washing steps, those cDNAs will 
be detected or quantitatively detected that have a counterpart in the oligonucleotide 
panel. Various advantageous embodiments of this general method are feasible. For 
example, the mRNA or the cDNA may be amplified e.g. by polymerase chain reaction, 

25 wherein it is preferable, for quantitative assessments, that the number of amplified copies 
corresponds relative to further amplified mRNAs or cDNAs to the number of mRNAs 
originally present in the cell. In a preferred embodiment of the present in ivention, the 
cDNAs are transcribed into cRNAs prior to the hybridisation step wherein only in the 
transcription step a label is incorporated into the nucleic acid and wherein the cRNA is 

30 employed for hybridisation. Alternatively, the label may be attached subsequent to the 
transcription step. 

Similarly, proteins from a cell or tissue under investigation may be contacted with a 
panel of aptamers or of antibodies or fragments or derivatives thereof. The antibodies 
35 etc. may be affixed to a solid support such as a chip. Binding of proteins indicative of an 
AML subtype may be verified by binding to a detectably labelled secondary antibody or 
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aptamer. For the labelling of antibodies, it is referred to Harlow and Lane, "Antibodies, a 
laboratory manual", CSH Press, 1988, Cold Spring Harbor. Specifically, a minimum set 
of proteins necessary for diagnosis of all AML subtypes may be selected for creation of a 
protein array system to make diagnosis on a protein lysate of a diagnostic bone marrow 
5 sample directly. Protein Array Systems for the detection of specific protein expression 
profiles already are available (for example: Bio-Plex, BIORAD, Miinchen, Germany). 
For this application preferably antibodies against the proteins have to be produced and 
immobilized on a platform e.g. glasslides or microtiterplates. The immobilized 
antibodies can be labelled with a reactant specific for the certain target proteins as 
10 discussed above. The reactants can include enzyme substrates, DNA, receptors, antigens 
or antibodies to create for example a capture sandwich immunoassay. 

For reliably distinguishing MLL-PTD-positive AML from other AML subtypes in a 
sample it is useful that the expression of more than one of the above defined markers is 

15 determined. As a criterion for the choice of markers, the statistical significance of 

markers as expressed in q or p values based on the concept of the false discovery rate is 
determined. In doing so, a measure of statistical significance called the q value is 
associated with each tested feature. The q value is similar to the p value, except it is a 
measure of significance in terms of the false discovery rate rather than the false positive 

20 rate (Storey JD and Tibshirani R. Proc.Natl.Acad.Sci., 2003, Vol. 100:9440-5. 

In a preferred embodiment of the present invention, markers as defined in Table 1.1-3.15 
having a q-value of less than 3E-03, more preferred less than 1.5E-09, most preferred 
less than 1.5E-11, less than 1.5E-20, less than 1.5E-30, are measured. 

25 

Of the above defined markers, the expression level of at least two, preferably of at least 
ten, more preferably of at least 25, most preferably of 50 of at least one of the Tables of 
the markers is determined. 

30 In another preferred embodiment, the expression level of at least 2, of at least 5, of at 

least 10 out of the markers having the numbers 1 -10, 1-20, 1-40, 1-50 of at least one of 
the Tables are measured. 

The level of the expression of the „marker", i.e. the expression of the polynucleotide is 
35 indicative of the AML subtype of a cell or an organism. The level of expression of a 

marker or group of markers is measured and is compared with the level of expression of 
the same marker or the same group of markers from other cells or samples. The 
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comparison may be effected in an actual experiment or in silico. When the expression 
level also referred to as expression pattern or expression signature (expression profile) is 
measurably different, there is according to the invention a meaningful difference in the 
level of expression. Preferably the difference at least is 5 %, 10% or 20%, more preferred 
5 at least 50% or may even be as high as 75% or 100%. More preferred the difference in 
the level of expression is at least 200%, i.e. two fold, at least 500%, i.e. five fold, or at 
least 1000%, i.e. 10 fold. 

Accordingly, the expression level of markers expressed lower in a first subtype than in at 
10 least one second subtype, which differs from the first subtype, is at least 5 %, 10% or 
20%, more preferred at least 50% or may even be 75% or 100%, i.e. 2-fold lower, 
preferably at least 10-fold, more preferably at least 50-fold, and most preferably at least 
100-fold lower in the first subtype. On the other hand, the expression level of markers 
expressed higher in a first subtype than in at least one second subtype, which differs 
15 from the first subtype, is at least 5 %, 10% or 20%, more preferred at least 50% or may 
even be 75% or 100%, i.e. 2-fold higher, preferably at least 10-fold, more preferably at 
least 50-fold, and most preferably at least 100-fold higher in the first subtype. 

In another embodiment of the present invention, the sample is derived from an individual 
20 having leukaemia, preferably AML. 

For the method of the present invention it is preferred if the polynucleotide the 
expression level of which is determined is in form of a transcribed polynucleotide. A 
particularly preferred transcribed polynucleotide is an mRNA, a cDNA and/or a cRNA, 
25 with the latter being preferred. Transcribed polynucleotides are isolated from a sample, 
reverse transcribed and/or amplified, and labelled, by employing methods well-known 
the person skilled in the art (see Example 3). In a preferred embodiment of the methods 
according to the invention, the step of determining the expression profile further 
comprises amplifying the transcribed polynucleotide. 

30 

In order to determine the expression level of the transcribed polynucleotide by the 
method of the present invention, it is preferred that the method comprises hybridizing the 
transcribed polynucleotide to a complementary polynucleotide, or a portion thereof, 
under stringent hybridization conditions, as described hereinafter. 

35 
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The term "hybridizing" means hybridization under conventional hybridization 
conditions, preferably under stringent conditions as described, for example, in 
Sambrook, J., et al., in "Molecular Cloning: A Laboratory Manual" (1989), Eds. J. 
Sambrook, E. F. Fritsch and T. Maniatis, Cold Spring Harbour Laboratory Press, Cold 
5 Spring Harbour, NY and the further definitions provided above. Such conditions are, for 
example, hybridization in 6x SSC, pH 7.0 / 0.1% SDS at about 45°C for 18-23 hours, 
followed by a washing step with 2x SSC/0.1% SDS at 50°C. In order to select the 
stringency, the salt concentration in the washing step can for example be chosen between 
2x SSC/0.1% SDS at room temperature for low stringency and 0.2x SSC/0.1% SDS at 

10 50°C for high stringency. In addition, the temperature of the washing step can be varied 
between room temperature, ca. 22°C, for low stringency, and 65 °C to 70° C for high 
stringency. Also contemplated are polynucleotides that hybridize at lower stringency 
hybridization conditions. Changes in the stringency of hybridization and signal detection 
are primarily accomplished' through the manipulation, preferably of formamide 

15 concentration (lower percentages of formamide result in lowered stringency), salt 
conditions, or temperature. For example, lower stringency conditions include an 
overnight incubation at 37°C in a solution comprising 6X SSPE (20X SSPE = 3M NaCl; 
0.2M NaH2P04; 0.02M EDTA, pH 7.4), 0.5% SDS, 30% formamide, 100 mg/ml 
salmon sperm blocking DNA, followed by washes at 50°C with 1 X SSPE, 0.1% SDS. In 

20 addition, to achieve even lower stringency, washes performed following stringent 

hybridization can be done at higher salt concentrations (e.g. 5x SSC). Variations in the 
above conditions may be accomplished through the inclusion and/or substitution of 
alternate blocking reagents used to suppress background in hybridization experiments. 
The inclusion of specific blocking reagents may require modification of the hybridization 

25 conditions described above, due to problems with compatibility. 

"Complementary" and "complementarity", respectively, can be described by the 
percentage, i.e. proportion, of nucleotides which can form base pairs between two 
polynucleotide strands or within a specific region or domain of the two strands. 

30 Generally, complementary nucleotides are, according to the base pairing rules, adenine 
and thymine (or adenine and uracil), and cytosine and guanine. Complementarity may be 
partial, in which only some of the nucleic acids 1 bases are matched according to the base 
pairing rules. Or, there may be a complete or total complementarity between the nucleic 
acids. The degree of complementarity between nucleic acid strands has effects on the 

35 efficiency and strength of hybridization between nucleic acid strands. 
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Two nucleic acid strands are considered to be 100% complementary to each other over a 
defined length if in a defined region all adenines of a first strand can pair with a thymine 
(or an uracil) of a second strand, all guanines of a first strand can pair with a cytosine of 
a second strand, all thymine (or uracils) of a first strand can pair with an adenine of a 
5 second strand, and all cytosines of a first strand can pair with a guanine of a second 

strand, and vice versa. According to the present invention, the degree of complementarity 
is determined over a stretch of 20, preferably 25, nucleotides, i.e. a 60% 
complementarity means that within a region of 20 nucleotides of two nucleic acid strands 
12 nucleotides of the first strand can base pair with 12 nucleotides of the second strand 

10 according to the above ruling, either as a stretch of 12 contiguous nucleotides or 

interspersed by non-pairing nucleotides, when the two strands are attached to each other 
over said region of 20 nucleotides. The degree of complementarity can range from at 
least about 50% to full, i.e. 100% complementarity. Two single nucleic acid strands are 
said to be "substantially complementary" when they are at least about 80% 

1 5 complementary, preferably about 90% or higher. For carrying out the method of the 
present invention substantial complementarity is preferred. 

Preferred methods for detection and quantification of the amount of polynucleotides, i.e. 
for the methods according to the invention allowing the determination of the level of 

20 expression of a marker, are those described by Sambrook et al. (1989) or real time 

methods known in the art as the TaqMan® method disclosed in WO92/02638 and the 
corresponding U.S. 5,210,015, U.S. 5,804,375, U.S. 5,487,972. This method exploits the 
exonuclease activity of a polymerase to generate a signal. In detail, the (at least one) 
target nucleic acid component is detected by a process comprising contacting the sample 

25 with an oligonucleotide containing a sequence complementary to a region of the target 
nucleic acid component and a labeled oligonucleotide containing a sequence 
complementary to a second region of the same target nucleic acid component sequence 
strand, but not including the nucleic acid sequence defined by the first oligonucleotide, to 
create a mixture of duplexes during hybridization conditions, wherein the duplexes 

30 comprise the target nucleic acid annealed to the first oligonucleotide and to the labeled 
oligonucleotide such that the 3 '-end of the first oligonucleotide is adjacent to the 5 '-end 
of the labeled oligonucleotide. Then this mixture is treated with a template-dependent 
nucleic acid polymerase having a 5' to 3' nuclease activity under conditions sufficient to 
permit the 5' to 3' nuclease activity of the polymerase to cleave the annealed, labeled 

35 oligonucleotide and release labeled fragments. The signal generated by the hydrolysis of 
the labeled oligonucleotide is detected and/ or measured. TaqMan® technology 
eliminates the need for a solid phase bound reaction complex to be formed and made 
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detectable. Other methods include e.g. fluorescence resonance energy transfer between 
two adjacently hybridized probes as used in the LightCycler® format described in U.S. 
6,174,670. 

5 A preferred protocol if the marker, i.e. the polynucleotide, is in form of a transcribed 
nucleotide, is described in Example 3, where total RNA is isolated, cDNA and, 
subsequently, cRNA is synthesized and biotin is incorporated during the transcription 
reaction. The purified cRNA is applied to commercially available arrays which can be 
obtained e.g. from Affymetrix. The hybridized cRNA is detected according to the 
10 methods described in Example 3. The arrays are produced by photolithography or other 
methods known to experts skilled in the art e.g. from U.S. 5,445,934, U.S. 5,744,305, 
U.S. 5,700,637, U.S. 5,945,334 and EP 0 619 321 or EP 0 373 203, or as decribed 
hereinafter in greater detail. 

15 In another embodiment of the present invention, the polynucleotide or at least one of the 
polynucleotides is in form of a polypeptide. In another preferred embodiment, the 
expression level of the polynucleotides or polypeptides is detected using a compound 
which specifically binds to the polynucleotide of the polypeptide of the present 
invention. 

20 

As used herein, "specifically binding" means that the compound is capable of 
discriminating between two or more polynucleotides or polypeptides, i.e. it binds to the 
desired polynucleotide or polypeptide, but essentially does not bind unspecifically to a 
different polynucleotide or polypeptide. 

25 

The compound can be an antibody, or a fragment thereof, an enzyme, a so-called small 
molecule compound, a protein-scaffold, preferably an anticalin. In a preferred 
embodiment, the compound specifically binding to the polynucleotide or polypeptide is 
an antibody, or a fragment thereof. 

30 

As used herein, an "antibody" comprises monoclonal antibodies as first described by 
Kohler and Milstein in Nature 278 (1975), 495-497 as well as polyclonal antibodies, i.e. 
entibodies contained in a polyclonal antiserum. Monoclonal antibodies include those 
produced by transgenic mice. Fragments of antibodies include F(ab')2, Fab and Fv 
35 fragments. Derivatives of antibodies include scFvs, chimeric and humanized antibodies. 
See, for example Harlow and Lane, loc. cit. For the detection of polypeptides using 
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antibodies or fragments thereof, the person skilled in the art is aware of a variety of 
methods, all of which are included in the present invention. Examples include 
immunoprecipitation, Western blotting, Enzyme-linked immuno sorbent assay (ELIS A), 
Enzyme-linked immuno sorbent assay (RIA), dissociation-enhanced lanthanide fluoro 
5 immuno assay (DELFIA), scintillation proximity assay (SPA). For detection, it is 
desirable if the antibody is labelled by one of the labelling compounds and methods 
described supra. 

In another preferred embodiment of the present invention, the method for distinguishing 
10 MLL-PTD-positive AML from other AML subtypes is carried out on an array. 

In general, an "array" or "microarray" refers to a linear or two- or three dimensional 
arrangement of preferably discrete nucleic acid or polypeptide probes which comprises 
an intentionally created collection of nucleic acid or polypeptide probes of any length 

15 spotted onto a substrate/solid support. The person skilled in the art knows a collection of 
nucleic acids or polypeptide spotted onto a substrate/solid support also under the term 
"array". As known to the person skilled in the art, a microarray usually refers to a 
miniaturised array arrangement, with the probes being attached to a density of at least 
about 10, 20, 50, 100 nucleic acid molecules referring to different or the same genes per 

20 cm 2 . Furthermore, where appropriate an array can be referred to as "gene chip". The 
array itself can have different formats, e.g. libraries of soluble probes or libraries of 
probes tethered to resin beads, silica chips, or other solid supports. 

The process of array fabrication is well-known to the person skilled in the art. In the 
25 following, the process for preparing a nucleic acid array is described. Commonly, the 

process comprises preparing a glass (or other) slide (e.g. chemical treatment of the glass 
to enhance binding of the nucleic acid probes to the glass surface), obtaining DNA 
sequences representing genes of a genome of interest, and spotting sequences these 
sequences of interest onto glass slide. Sequences of interest can be obtained via creating 
30 a cDNA library from an mRNA source or by using publicly available databases, such as 
GeneBank, to annotate the sequence information of custom cDNA libraries or to identify 
cDNA clones from previously prepared libraries. Generally, it is recommendable to 
amplify obtained sequences by PCR in order to have sufficient amounts of DNA to print 
on the array. The liquid containing the amplified probes can be deposited on the array by 
35 using a set of microspotting pins. Ideally, the amount deposited should be uniform. The 
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process can further include UV-crosslinking in order to enhance immobilization of the 
probes on the array. 

In a preferred embodiment, the array is a high density oligonucleotide (oligo) array using 
5 a light-directed chemical synthesis process, employing the so-called photolithography 
technology. Unlike common cDNA arrays, oligo arrays (according to the Affymetrix 
technology) use a single-dye technology. Given the sequence information of the markers, 
the sequence can be synthesized directly onto the array, thus, bypassing the need for 
physical intermediates, such as PCR products, required for making cDNA arrays. For 

10 this purpose, the marker, or partial sequences thereof, can be represented by 14 to 20 

features, preferably by less than 14 features, more preferably less than 10 features, even 
more preferably by 6 features or less, with each feature being a short sequence of 
nucleotides (oligonucleotide), which is a perfect match (PM) to a segment of the 
respective gene. The PM oligonucleotide are paired with mismatch (MM) 

15 oligonucleotides which have a single mismatch at the central base of the nucleotide and 
are used as "controls". The chip exposure sites are defined by masks and are deprotected 
by the use of light, followed by a chemical coupling step resulting in the synthesis of one 
nucleotide. The masking, light deprotection, and coupling process can then be repeated 
to synthesize the next nucleotide, until the nucleotide chain is of the specified length. 

20 

Advantageously, the method of the present invention is carried out in a robotics system 
including robotic plating and a robotic liquid transfer system, e.g. using microfluidics, 
i.e. channelled structured. 

25 A particular preferred method according to the present invention is as follows: 

1 . Obtaining a sample, e.g. bone marrow aliquots, from a patient having AML 

2. Extracting RNA, preferably mRNA, from the sample 

3. Reverse transcribing the RNA into cDNA 

4. In vitro transcribing the cDNA into cRNA 
30 5 . Fragmenting the cRNA 

6. Hybridizing the fragmented cRNA on standard microarrays 

7. Determining hybridization 

In another embodiment, the present invention is directed to the use of at least one marker 
35 selected from the markers identifiable by their Affymetrix Identification Numbers (affy 
id) as defined in Tables 1, 2, and/or 3, for the manufacturing of a diagnostic for 
distinguishing MLL-PTD-positive AML from other AML subtypes. The use of the 
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present invention is particularly advantageous for distinguishing MLL-PTD-positive 
AML from other AML subtypes in an individual having AML. The use of said markers 
for diagnosis of MLL-PTD-positive AML, preferably based on microarray technology, 
offers the following advantages: (1) more rapid and more precise diagnosis, (2) easy to 
5 use in laboratories without specialized experience, (3) abolishes the requirement for 
analyzing viable cells for chromosome analysis (transport problem), and (4) very 
experienced hematologists for cytomorphology and cytochemistry, immunophenotyping 
as well as cytogeneticists and molecularbiologists are no longer required. 

10 Accordingly, the present invention refers to a diagnostic kit containing at least one 
marker selected from the markers identifiable by their Affymetrix Identification 
Numbers (affy id) as defined in Tables 1, and/or 3 for distinguishing MLL-PTD-positive 
AML from other AML subtypes, in combination with suitable auxiliaries. Suitable 
auxiliaries, as used herein, include buffers, enzymes, labelling compounds, and the like. 

15 In a preferred embodiment, the marker contained in the kit is a nucleic acid molecule 

which is capable of hybridizing to the mRNA corresponding to at least one marker of the 
present invention. Preferably, the at least one nucleic acid molecule is attached to a solid 
support, e.g. a polystyrene microtiter dish, nitrocellulose membrane, glass surface or to 
non-immobilized particles in solution. 

20 

In another preferred embodiment, the diagnostic kit contains at least one reference for a 
MLL-PTD-positive AML subtype. As used herein, the reference can be a sample or a 
data bank. 

25 In another embodiment, the present invention is directed to an apparatus for 

distinguishing MLL-PTD-positive AML from other AML subtypes in a sample, 
containing a reference data bank obtainable by comprising 

(a) compiling a gene expression profile of a patient sample by determining the 
expression level at least one marker selected from the markers identifiable by 

30 their Affymetrix Identification Numbers (affy id) as defined in Tables 1 , 

and/or 3, and 

(b) classifying the gene expression profile by means of a machine learning 
algorithm. 

35 According to the present invention, the "machine learning algorithm" is a computational- 
based prediction methodology, also known to the person skilled in the art as "classifier", 
employed for characterizing a gene expression profile. The signals corresponding to a 
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certain expression level which are obtained by the microarray hybridization are subjected 
to the algorithm in order to classify the expression profile. Supervised learning involves 
"training" a classifier to recognize the distinctions among classes and then "testing" the 
accuracy of the classifier on an independent test set. For new, unknown sample the 
5 classifier shall predict into which class the sample belongs. 

Preferably, the machine learning algorithm is selected from the group consisting of 
Weighted Voting, K-Nearest Neighbors, Decision Tree Induction, Support Vector 
Machines (SVM), and Feed-Forward Neural Networks. Most preferably, the machine 
10 learning algorithm is Support Vector Machine, such as polynomial kernel and Gaussian 
Radial Basis Function-kernel SVM models. 

The classification accuracy of a given gene list for a set of microarray experiments is 
preferably estimated using Support Vector Machines (SVM), because there is evidence 

15 that SVM-based prediction slightly outperforms other classification techniques like k- 
Nearest Neighbors (k-NN). The LIBSVM software package version 2.36 was used 
(SVM-type: C-SVC, linear kernel (http://wwwxsie.nUi.edu.tw/-cjlin/libsvm/)). The 
skilled artisan is furthermore referred to Brown et al., Proc.Natl.Acad.Sci., 2000; 97: 
262-267, Furey et al., Bioinformatics. 2000; 16: 906-914, and Vapnik V. Statistical 

20 Learning Theory. New York: Wiley, 1998. 

In detail, the classification accuracy of a given gene list for a set of microarray 
experiments can be estimated using Support Vector Machines (SVM) as supervised 
learning technique. Generally, SVMs are trained using differentially expressed genes 

25 which were identified on a subset of the data and then this trained model is employed to 
assign new samples to those trained groups from a second and different data set. 
Differentially expressed genes were identified applying ANOVA and t-test-statistics 
(Welch t-test). Based on identified distinct gene expression signatures respective training 
sets consisting of 2/3 of cases and test sets with 1/3 of cases to assess classification 

30 accuracies are designated. Assignment of cases to training and test set is randomized and 
balanced by diagnosis. Based on the training set a Support Vector Machine (SVM) 
model is built. 

According to the present invention, the apparent accuracy, i.e. the overall rate of correct 
35 predictions of the complete data set was estimated by lOfold cross validation. This 
means that the data set was divided into 10 approximately equally sized subsets, an 



Attorney Docket No. 22339-US 
Express Mailing Label EV 338302397 US 



-22- 

SVM-model was trained for 9 subsets and predictions were generated for the remaining 
subset. This training and prediction process was repeated 10 times to include predictions 
for each subset. Subsequently the data set was split into a training set, consisting of two 
thirds of the samples, and a test set with the remaining one third. Apparent accuracy for 
5 the training set was estimated by lOfold cross validation (analogous to apparent accuracy 
for complete set). A SVM-model of the training set was built to predict diagnosis in the 
independent test set, thereby estimating true accuracy of the prediction model. This 
prediction approach was applied both for overall classification (multi-class) and binary 
classification (diagnosis X => yes or no). For the latter, sensitivity and specificity were 
10 calculated: 

Sensitivity = (number of positive samples predicted)/(number of true positives) 
Specificity = (number of negative samples predicted)/(number of true negatives) 

In a preferred embodiment, the reference data bank is backed up on a computational data 
15 memory chip which can be inserted in as well as removed from the apparatus of the 
present invention, e.g. like an interchangeable module, in order to use another data 
memory chip containing a different reference data bank. 

The apparatus of the present invention containing a desired reference data bank can be 
20 used in a way such that an unknown sample is, first, subjected to gene expression 

profiling, e.g. by microarray analysis in a manner as described supra or in the art, and the 
expression level data obtained by the analysis are, second, fed into the apparatus and 
compared with the data of the reference data bank obtainable by the above method. For 
this purpose, the apparatus suitably contains a device for entering the expression level of 
25 the data, for example a control panel such as a keyboard. The results, whether and how 
the data of the unknown sample fit into the reference data bank can be made visible on a 
provided monitor or display screen and, if desired, printed out on an incorporated of 
connected printer. 

30 Alternatively, the apparatus of the present invention is equipped with particular 
appliances suitable for detecting and measuring the expression profile data and, 
subsequently, proceeding with the comparison with the reference data bank. In this 
embodiment, the apparatus of the present invention can contain a gripper arm and/or a 
tray which takes up the microarray containing the hybridized nucleic acids. 



35 
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In another embodiment, the present invention refers to a reference data bank for 
distinguishing MLL-PTD-positive AML from other AML subtypes in a sample 
obtainable by comprising 

(a) compiling a gene expression profile of a patient sample by determining the 

5 expression level of at least one marker selected from the markers identifiable 

by their Affymetrix Identification Numbers (affy id) as defined in Tables 1, 
and/or 3, and 

(b) classifying the gene expression profile by means of a machine learning 
algorithm. 

10 

Preferably, the reference data bank is backed up and/or contained in a computational 
memory data chip. 

The invention is further illustrated in the following table and examples, without limiting 
1 5 the scope of the invention: 

TABLE 1.1-3.15 

Table 1.1-3.15 show AML subtype analysis of MLL-PTD-positive AML versus other 
20 AML subtypes. The analysed markers are ordered according to their q-values, beginning 
with the lowest q-values. 

For convenience and a better understanding, Tables 1.1 to 3.15 are accompanied with 
explanatory tables (Table LI A to 3. 15 A) where the numbering and the Affymetrix Id are 
further defined by other parameters, e.g. gene bank accession number. 

25 

EXAMPLES 

Example 1 : General experimental design of the invention and results 

30 Partial tandem duplication within the MLL-gene (MLL-PTD) can be found in 10% of 

AML with normal karyotype. Like MLL-translocations (t(l Iq23)/MLL) the occurence of 
MLL-PTD is characterized by an unfavourable prognosis. The pathogenetic mechanisms 
of the MLL-PTD are poorly understood and downstream genes effected by this 
molecular aberration are not known. To get more insight into the pathogenesis of PTD+ 

35 AML we performed global gene expression profiling of 1 84 AML samples at diagnosis 
using the U133 set of expression microarrays (Affymetrix) with >30,000 human genes 
represented on both arrays. Microarray data was analyzed by pattern recognition 
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algorithms (Principal Component Analysis (PCA), hierarchical clustering), as well as 
Support Vector Machines (SVM) for estimation of classification accuracies. Therefore, 
all samples were divided into a training set consisting of 2/3 of cases to built a SVM 
model and a test set with remaining 1/3 of cases. Assignment of cases to training and test 
5 set was randomized and balanced by diagnosis. Differentially expressed genes were 
selected according to ANOVA and t-test-statistics in the training set. Classification 
accuracy was assessed in the test set. In detail, we analyzed 30 cases with t(l Iq23)/MLL, 
30 cases with normal karyotype AML and MLL-PTD (PTD+ AML) and 124 cases with 
normal karyotype without MLL-PTD (AML-NK). All data analysis algorithms 

10 demonstrate that PTD+ AML can clearly be distinguished from t(l Iq23)/MLL positive 
AML with 100% accuracy. Thus, despite an identical gene targeted by molecular 
mutation or chromosomal translocation, this finding illustrates that both kinds of 
aberrations lead to biologically distinct leukemia subclasses. Some of the most 
significantly differentially expressed genes that were highly expressed in t(l Iq23)/MLL 

1 5 in comparison to PTD+ AML were CACNA2DA, MBNL1 , and PBX3 . Reversely, genes 
with high expression in PTD+ and low in t(l Iq23)/MLL samples were HOXB5, 
HOXB2, MAN1A1, and ZNF207. At next, we addressed the question whether PTD+ 
AML can be discriminated from AML-NK by a specific gene expression signature. Both 
PCA and hierarchical cluster visualize that the MLL-PTD samples characterize a 

20 homogeneous subgroup within AML with normal karyotype, but do not separate from 

them. Some of the genes that were highly expressed in AML-NK and low in PTD+ were 
AAK1, RAB4A, HOXA2, BID. On the other hand genes that were low in AML-NK and 
high in PTD+ were, among others, MLL, YY1, and SRP46. In addition, we attempted to 
classify the analyzed samples by means of SVM. Here, the training set comprised 83 

25 AML-NK and 19 PTD+ AML cases, the test set 41 AML-NK and 9 PTD+ AML cases, 
respectively. The 50 test samples were assigned to the correct group with an accuracy of 
88%. In detail, 6/9 PTD+ AML (92.7% specificity, 66.7% sensitivity) and 38/41 AML- 
NK (66.7% specificity, 92.7% sensitivity) were accurately assigned. In conclusion, 
despite a significantly worse prognosis of the PTD+ AML cases within the large group 

30 of AML with normal karyotype it is not possible to designate a highly characteristic 

specific gene expression signature at diagnosis as has been demonstrated for AML with 
balanced chromosomal aberrations. This unexpected results may be in part due to the 
fact that pts with PTD do not belong to a specific morphologic subgroup. Thus the 
expression pattern associated with heterogenous FAB subtypes may overwrite that 

35 generated bei the PTD. In addition, different unknown accompanying mutation may 
generate a dominant expression pattern. 
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Example 2: General materials, methods and definitions of functional annotations 

The methods section contains both information on statistical analyses used for 
identification of differentially expressed genes and detailed annotation data of identified 
5 microarray probesets. 

Affymetrix Probeset Annotation 

All annotation data of GeneChip® arrays are extracted from the NetAffx™ Analysis 
Center (internet website: www.affymetrix.com). Files for U133 set arrays, including 
10 U133A and U133B microarrays are derived from the June 2003 release. The original 
publication refers to: Liu G, Loraine AE, Shigeta R, Cline M, Cheng J, Valmeekam V, 
Sun S, Kulp D, Siani-Rose MA. NetAffx: Affymetrix probesets and annotations. Nucleic 
Acids Res. 2003;31(l):82-6. 

15 The sequence data are omitted due to their large size, and because they do not change, 
whereas the annotation data are updated periodically, for example new information on 
chromomal location and functional annotation of the respective gene products. Sequence 
data are available for download in the NetAffx Download Center (www.affymetrix.com) 

20 Data fields: 

In the following section, the content of each field of the data files are described. 
Microarray probesets, for example found to be differentially expressed between different 
types of leukemia samples are further described by additional information. The fields are 
of the following types: 

25 

1 . GeneChip Array Information 

2. Probe Design Information 

3. Public Domain and Genomic References 

30 1 . GeneChip Array Information 
HG-U133 ProbeSet_ID: 

HG-U133 ProbeSetID describes the probe set identifier. Examples are: 200007_at, 
20001 l_s_at, 200012_x_at. 

35 

GeneChip: 
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The description of the GeneChip probe array name where the respective probeset is 
represented. Examples are: Affymetrix Human Genome U133A Array or Affymetrix 
Human Genome U133B Array. 

5 2. Probe Design Information 

Sequence Type: 

The Sequence Type indicates whether the sequence is an Exemplar, Consensus or 
Control sequence. An Exemplar is a single nucleotide sequence taken directly from a 
1 0 public database. This sequence could be an mRNA or EST. A Consensus sequence, is a 
nucleotide sequence assembled by Affymetrix, based on one or more sequence taken 
from a public database. 

Transcript ID: 

15 The cluster identification number with a sub-cluster identifier appended. 
Sequence Derived From: 

The accession number of the single sequence, or representative sequence on which the 
probe set is based. Refer to the "Sequence Source" field to determine the database used. 

20 

Sequence ID: 

For Exemplar sequences: Public accession number or GenBank identifier. For Consensus 
sequences: Affymetrix identification number or public accession number. 

25 Sequence Source: 

The database from which the sequence used to design this probe set was taken. Examples 
are: GenBank®, RefSeq, UniGene, TIGR (annotations from The Institute for Genomic 
Research). 

30 3. Public Domain and Genomic References 

Most of the data in this section come from LocusLink and UniGene databases, and are 
annotations of the reference sequence on which the probe set is modeled. 

35 Gene Symbol and Title: 

A gene symbol and a short title, when one is available. Such symbols are assigned by 
different organizations for different species. Affymetrix annotational data come from the 
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UniGene record. There is no indication which species-specific databank was used, but 
some of the possibilities include for example HUGO: The Human Genome Organization. 

MapLocation: 

5 The map location describes the chromosomal location when one is available. 
UnigeneAccession: 

UniGene accession number and cluster type. Cluster type can be "full length" or "est", or 
" — " if unknown. 

10 

LocusLink: 

This information represents the LocusLink accession number. 

Full Length Ref. Sequences: 
15 Indicates the references to multiple sequences in RefSeq. The field contains the ID and 
description for each entry, and there can be multiple entries per probeSet. 

Example 3: Sample preparation, processing and data analysis 

20 Method 1: 

Microarray analyses were performed utilizing the GeneChip® System (Affymetrix, Santa 
Clara, USA). Hybridization target preparations were performed according to 
recommended protocols (Affymetrix Technical Manual). In detail, at time of diagnosis, 
mononuclear cells were purified by Ficoll-Hypaque density centrifugation. They had 

25 been lysed immediately in RLT buffer (Qiagen, Hilden, Germany), frozen, and stored at - 
80°C from 1 week to 38 months. For gene expression profiling cell lysates of the 
leukemia samples were thawed, homogenized (QIAshredder, Qiagen), and total RNA 
was extracted (RNeasy Mini Kit, Qiagen). Subsequently, 5-10 fig total RNA isolated 
from 1 x 10 7 cells was used as starting material for cDNA synthesis with 

30 oligo[(dT)24T7promotor] 6 5 primer (cDNA Synthesis System, Roche Applied Science, 
Mannheim, Germany). cDNA products were purified by phenol/chlorophorm/IAA 
extraction (Ambion, Austin, USA) and acetate/ethanol-precipitated overnight. For 
detection of the hybridized target nucleic acid biotin-labeled ribonucleotides were 
incorporated during the following in vitro transcription reaction (Enzo Bio Array 

35 HighYield RNA Transcript Labeling Kit, Enzo Diagnostics). After quantification by 

spectrophotometric measurements and 260/280 absorbance values assessment for quality 
control of the purified cRNA (RNeasy Mini Kit, Qiagen), 15 /xg cRNA was fragmented 
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by alkaline treatment (200 mM Tris-acetate, pH 8.2/500 mM potassium acetate/150 mM 
magnesium acetate) and added to the hybridization cocktail sufficient for five 
hybridizations on standard GeneChip microarrays (300 /xl final volume). Washing and 
staining of the probe arrays was performed according to the recommended Fluidics 
5 Station protocol (EukGE-WS2v4). Affymetrix Microarray Suite software (version 5.0.1) 
extracted fluorescence signal intensities from each feature on the microarrays as detected 
by confocal laser scanning according to the manufacturer's recommendations. 

Expression analysis quality assessment parameters included visiual array inspection of 
10 the scanned image for the presence of image artifacts and correct grid alignment for the 
identification of distinct probe cells as well as both low 375' ratio of housekeeping 
controls (mean: 1.90 for GAPDH) and high percentage of detection calls (mean: 46.3% 
present called genes). The 3' to 5' ratio of GAPDH probesets can be used to assess RNA 
sample and assay quality. Signal values of the 3' probe sets for GAPDH are compared to 
15 the Signal values of the corresponding 5' probe set. The ratio of the 3' probe set to the 5 ' 
probe set is generally no more than 3.0. A high 3' to 5' ratio may indicate degraded RNA 
or inefficient synthesis of ds cDNA or biotinylated cRNA (GeneChip Expression 
Analysis Technical Manual, www.affymetrix.com). Detection calls are used to determine 
whether the transcript of a gene is detected (present) or undetected (absent) and were 
20 calculated using default parameters of the Microarray Analysis Suite MAS 5.0 software 
package. 

Method 2: 

Bone marrow (BM) aspirates are taken at the time of the initial diagnostic biopsy and 
25 remaining material is immediately lysed in RLT buffer (Qiagen), frozen and stored at -80 
C until preparation for gene expression analysis. For microarray analysis the GeneChip 
System (Affymetrix, Santa Clara, CA, USA) is used. The targets for GeneChip analysis 
are prepared according to the current Expression Analysis. Briefly, frozen lysates of the 
leukemia samples are thawed, homogenized (QIAshredder, Qiagen) and total RNA 
30 extracted (RNeasy Mini Kit, Qiagen).Normally 10 ug total RNA isolated from 1 x 107 
cells is used as starting material in the subsequent cDNA-Synthesis using Ohgo-dT-T7- 
Promotor Primer (cDNA synthesis Kit, Roche Molecular Biochemicals). The cDNA is 
purified by phenol-chlorophorm extraction and precipitated with 100% Ethanol over 
night. For detection of the hybridized target nucleic acid biotin-labeled ribonucleotides 
35 are incorporated during the in vitro transcription reaction (Enzo® Bio Array™ 

HighYield™ RNA Transcript Labeling Kit, ENZO). After quantification of the purified 
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cRNA (RNeasy Mini Kit, Qiagen), 15 ug are fragmented by alkaline treatment (200 mM 
Tris-acetate, pH 8.2, 500 mM potassium acetate, 150 mM magnesium acetate) and added 
to the hybridization cocktail sufficient for 5 hybridizations on standard GeneChip 
microarrays. Before expression profiling Test3 Probe Arrays (Affymetrix) are chosen for 
5 monitoring of the integrity of the cRNA. Only labeled cRNA-cocktails which showed a 
ratio of the messured intensity of the 3' to the 5* end of the GAPDH gene less than 3.0 are 
selected for subsequent hybridization on HG-U133 probe arrays (Affymetrix). Washing 
and staining the Probe arrays is performed as described (siehe Affymetrix-Original- 
Literatur (LOCKHART und LIPSHUTZ). The Affymetrix software (Microarray Suite, 
10 Version 4.0.1) extracted fluorescence intensities from each element on the arrays as 

detected by confocal laser scanning according to the manufacturers recommendations. 

While the foregoing invention has been described in some detail for purposes of clarity 
and understanding, it will be clear to one skilled in the art from a reading of this 

15 disclosure that various changes in form and detail can be made without departing from 
the true scope of the invention. For example, all the techniques and apparatus described 
above can be used in various combinations. All publications, patents, patent 
applications, and/or other documents cited in this application are incorporated by 
reference in their entirety for all purposes to the same extent as if each individual 

20 publication, patent, patent application, and/or other document were individually 
indicated to be incorporated by reference for all purposes. 
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-0.78 


-8.55 


7p15-p14 


45 


227929__at 




-8.81 


7.13E-14 


6.11E-11 


-0.77 


-8.54 




46 


20541 9_at 


EBI2 


2.88 


7.89E-10 


1 .81 E-07 


0.99 


8.54 


13q32.2 


47 


230894_s_at 




-6.89 


4.65E-14 


4.33E-1 1 


-0.75 


-8.50 




48 


223044_at 


SLC11A3 


-6.02 


5.22E-14 


4.59E-1 1 


-0.75 


-8.48 


2q32 


49 


218477_at 


PTD011 


-2.45 


1.30E-13 


9.87E-11 


-0.75 


-8.38 


6p12.1 


50 


212463_at 




-3.96 


1.08E-13 


8.58E-11 


-0.74 


-8.38 








































2.2 


P~TD vpr^u<i rp<?t 






























# 


affy id 


HUGO name 


fc 




q 


stn 


t 


Map Location 




AFFX- 

HSAC07/X00351 M at 
-HG-U133B 


ACTB 


-1.79 


2.83E-10 


5.20E-07 


-0.80 


-7.93 


7p15-p12 


2 


200885_at 


ARHC 


-2.26 


3.49E-12 


4.17E-08 


-0.72 


-7.92 


1p13.1 


3 


205131_x_at 


SCGF 


-4.34 


1.66E-12 


3.97E-08 


-0.69 


-7.85 


19q13.3 


4 


208623_s_at 


VI L2 


-2.26 


3.72E-11 


1 .33E-07 


-0.74 


-7.81 


6q25.2-q26 


5 


205600_x_at 


HOXB5 


2.65 


1 .37E-07 


3.63E-05 


1.24 


7.61 


17q21.3 


6 


210783_x_at 


SCGF 


-4.02 


8.51 E-12 


6.78E-08 


-0.67 


-7.54 


19q13.3 


7 


208858_s_at 


MBC2 


-2.31 


5.02E-11 


1 .33E-07 


-0.69 


-7.49 


12q13.13 


8 


220363_s_at 


ELM02 


-4.14 


1 .34E-1 1 


7.98E-08 


-0.66 


-7.45 


20q13 


9 


224659_at 


SEPN1 


-2.56 


7.93E-10 


1.01E-06 


-0.73 


-7.42 


1p36.13 


10 


218530_at 


FHOD1 


-1.64 


4.86E-11 


1 .33E-07 


-0.66 


-7.35 


16q22 


11 


209679_s_at 


LOC57228 


-2.47 


5.02E-10 


7.50E-07 


-0.70 


-7.34 


12q13.12 


12 


203331 _s_at 


INPP5D 


-2.87 


3.72E-11 


1 .33E-07 


-0.66 


-7.34 


2q36-q37 


13 


205601_s_at 


HOXB5 


3.02 


2.74E-07 


5.85E-05 


1.19 


7.29 


17q21.3 


14 


210213__s_at 


ITGB4BP 


-1.69 


2.12E-10 


4.61 E-07 


-0.67 


-7.26 


20q12 


15 


225065_x_at 


MGC40157 


-2.21 


4.24E-11 


1 .33 E-07 


-0.64 


-7.23 


17p11.2 


16 


227711 at 


FLJ32942 


-3.24 


8.04E-10 


1.01E-06 


-0.69 


-7.20 


12q13.13 


17 


203332 s at 


INPP5D 


-1.59 


1 .44E-09 


1 .43E-06 


-0.67 


-7.06 


2q36-q37 


18 


214789 x at 


SRP46 


1.67 


8 09E-09 


5.54E-06 


0.72 


7.05 


11q22 


19 


AFFX- 

HSAC07/X00351 M at 
- HG-U133A 


ACTB 


-1.72 


1.41E-08 


8.14E-06 


-0.74 


-7.04 


7p15-p12 


20 


201043_s_at 


ANP32A 


-2.54 


4.44E-10 


7.07E-07 


-0.63 


-6.95 


15q22.3-q23 


21 


201389_at 


ITGA5 


-1.80 


2.09E-10 


4.61 E-07 


-0.61 


-6.92 


12q11-q13 


22 


207106_s_at 


LTK 


-2.46 


2.45E-10 


4.89E-07 


-0.61 


-6.90 


15q15.1-q21.1 


23 


208072_s_at 


DGKD 


-1.99 


1.14E-09 


1 .30E-06 


-0.63 


-6.85 


2q37.1 


24 


211709__s_at 


SCGF 


-2.48 


1.74E-08 


9.22E-06 


-0.69 


-6.81 


19q13.3 


25 


213048_s_at 


SET 


-1.78 


4.29E-10 


7.07E-07 


-0.61 


-6.81 


9q34 
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26 


200982_s_at 


ANXA6 


-2.26 


2.79E-09 


2.47E-06 


-0.64 


-6.81 


5q32-q34 


27 


229143_at 


CNOT3 


1.81 


2.88E-07 


6.03E-05 


0.86 


6.80 


19q13.4 


28 


227564_at 


FLJ32731 


-2.59 


5.94E-10 


8.35E-07 


-0.60 


-6.73 


8p11.1 


29 


213159_at 


PCNX 


-2.06 


2.67E-08 


1.19E-05 


-0.69 


-6.73 


14q24.1 


30 


209406_at 


BAG2 


2.62 


2.93E-07 


6.03 E-05 


0.82 


6.68 


6p12.3-p11.2 


31 


217223_s_at 


BCR 


-2.04 


1.40E-09 


1.43E-06 


-0.60 


-6.68 


22q 11.23 


32 


221879_at 


MGC4809 


-2.49 


7.52E-09 


5.45E-06 


-0.64 


-6.67 


15q22.2 


33 


201005_at 


CD9 


-3.85 


3.17E-09 


2.70E-06 


-0.61 


-6.64 


12p13.3 


34 


226678_at 




-2.50 


1.06E-09 


1 .27E-06 


-0.59 


-6.64 




35 


214475_x_at 


CAPN3 


-7.68 


1 .24E-09 


1 .35E-06 


-0.60 


-6.62 


15q15.1-q21.1 


36 


226640__at 


LOC221955 


-1.94 


1.14E-08 


7.08E-06 


-0.62 


-6.52 


7p22.2 


37 


232424_at 


PRDM16 


9.69 


1 .70E-06 


2.01 E-04 


1.16 


6.52 


1p36.23-p33 


38 


210150_s_at 


LAMA5 


-2.22 


3.39E-09 


2.79E-06 


-0.59 


-6.51 


20q13.2-q13.3 


39 


221560_at 


MARK4 


-2.20 


1.73E-09 


1 .65E-06 


-0.57 


-6.50 


19q13.3 


40 


205366_s_at 


HOXB6 


16.00 


1.82E-06 


2.11 E-04 


1.15 


6.49 


17q21.3 


41 


244413_at 


DCAL1 


-3.48 


2.53E-09 


2.33E-06 


-0.58 


-6.46 


12p13.2 


42 


208698_s_at 


NONO 


-1.72 


4.57E-08 


1 .68E-05 


-0.65 


-6.46 


Xq13.1 


43 


204612_at 


PKIA 


2.61 


1.02E-06 


1 .39E-04 


0.87 


6.45 


8q21.11 


44 


231775_at 




-2.59 


1.82E-08 


9.47E-06 


-0.61 


-6.42 




45 


224773_at 


NAV1 


-2.62 


3.52 E-09 


2.80E-06 


-0.57 


-6.42 




46 


229908_s_at 


CAB56184 


1.97 


7.20E-07 


1.11 E-04 


0.81 


6.42 


16p13.3 


47 


218892_at 


PCDH16 


-2.31 


2.21 E-08 


1 .08E-05 


-0.61 


-6.36 


11p15.4 


48 


202315_s_at 


BCR 


-1.64 


1 .57 E-08 


8.74E-06 


-0.59 


-6.34 


22q 11.23 


49 


201288_at 


ARHGDIB 


-1.33 


2.13E-07 


4.89E-05 


-0.68 


-6.32 


12p12.3 


50 


64408_s_at 


MGC4809 


-2.13 


2.15E-08 


1 .08E-05 


-0.60 


-6.32 


15q22.2 






































2.3 


inv3 versus rest 






























# 


affy id 


HUGO name 


fc 


P 


q 


stn 


t 


Map Location 


1 


205382_s_at 


DF 


-6.46 


4.97E-25 


1.13E-20 


-1.18 


-13.29 


19p13.3 


2 


202759_s_at 


AKAP2 


-3.39 


7.20E-17 


3.27E-13 


-0.99 


-10.68 


9q31-q33 


3 


24262 1_at 


FLJ32468 


-1.53 


1.64E-14 


3.11 E-1 1 


-1.04 


-10.66 


7q22.1 


4 


228161_at 


RAB32 


-2.13 


8.90E-18 


9.02E-14 


-0.92 


-10.28 


6q24.2 


5 


223534_s_at 


RPS6KL1 


-1.99 


4.86E-13 


4.80E-10 


-1.00 


-10.06 


14q24.2 


6 


212953_x_at 


CALR 


-2.71 


1.19E-17 


9.02E-14 


-0.89 


-10.00 


19p13.3-p13.2 


7 


210115_at 


RPL39L 


-7.93 


2.28E-17 


1.29E-13 


-0.90 


-9.98 


3q27 


8 


212318_at 


TRN-SR 


-2.27 


1.03E-13 


1.30E-10 


-0.96 


-9.94 


7q32.2 


9 


223703_at 


CDA017 


-2.69 


2.02E-15 


5.29E-12 


-0.91 


-9.87 


10q23.1 


10 


200700_s_at 


KDELR2 


-2.42 


9.75E-15 


2.01 E-1 1 


-0.92 


-9.81 


7p22.2 


11 


214575_s_at 


AZU1 


-6.38 


3.02E-16 


9.80E-13 


-0.87 


-9.68 


19p13.3 


12 


20492 1_at 


GAS8 


-2.97 


1.54E-16 


5.83E-13 


-0.85 


-9.55 


16q24.3 


13 


203949_at 


MPO 


-3.93 


1.75E-12 


1.59E-09 


-0.93 


-9.49 


17q23.1 


14 


231300_at 


LOC90835 


-2.91 


4.34E-14 


5.79E-11 


-0.87 


-9.39 


16p11.2 


15 


231736_x_at 


MGST1 


-3.72 


8.09E-12 


5.10E-09 


-0.92 


-9.26 


12p12.3-p12.1 


16 


226789_at 




-2.38 


9.71 E-1 3 


9.18E-10 


-0.88 


-9.16 
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17 


20430 1_at 


KIAA0711 


-8.64 


2.10E-15 


5.29E-12 


-0.81 


-9.12 


8p23.2 


18 


205131_x_at 


SCGF 


-6.25 


3.59E-15 


8.16E-12 


-0.79 


-8.98 


19q13.3 


19 


202760_s_at 


AKAP2 


-4.21 


2.17E-13 


2.59E-10 


-0.83 


-8.92 


9q31-q33 


20 


224886_at 


STUB1 


-1.76 


7.73E-12 


5.01 E-09 


-0.86 


-8.86 


16p13.3 


21 


203948_s_at 


MPO 


-4.76 


2.30E-12 


1.86E-09 


-0.84 


-8.84 


17q23.1 


22 


230044_at 




-2.88 


1.54E-11 


8.30E-09 


-0.86 


-8.79 




23 


204647_at 


HOMER3 


-4.13 


2.16E-14 


3.76E-11 


-0.78 


-8.75 


19p13.11 


24 


22491 8_x_at 


MGST1 


-3.41 


3.23E-10 


9.26 E-08 


-0.91 


-8.73 


12p12.3-p12.1 


25 


210783_x_at 


SCGF 


-5.85 


2.56E-14 


4.15E-11 


-0.77 


-8.64 


19q13.3 


26 


230480_at 


HIWI2 


-3.13 


3.91 E-14 


5.55E-11 


-0.77 


-8.63 


11q21 


27 


204548_at 


STAR 


-8.65 


2.80E-14 


4.23E-11 


-0.77 


-8.62 


8p11.2 


28 


205248_at 


C21orf5 


-1.82 


1.05E-11 


5.93E-09 


-0.82 


-8.56 


21q22.2 


29 


240672_at 




-1.53 


3.51 E-13 


3.79E-10 


-0.78 


-8.53 




30 


232250_at 


KIAA1257 


-2.91 


1.98E-11 


9.58E-09 


-0.82 


-8.51 


3q21.3 


31 


211048_s_at 


ERP70 


-2.42 


2.32E-13 


2.63E-10 


-0.76 


-8.47 


7q35 


32 


201186_at 


LRPAP1 


-2.41 


3.24E-12 


2.37E-09 


-0.79 


-8.45 


4p16.3 


33 


24391 7_at 




-1.41 


2.18E-12 


1.83E-09 


-0.78 


-8.38 




34 


22484 1_x_at 




1.47 


2.25E-08 


2.47E-06 


0.99 


8.33 




35 


239656_at 




-2.19 


2.57E-12 


1 .95E-09 


-0.77 


-8.33 




36 


211709_s_at 


SCGF 


-3.42 


1.61E-09 


3.17E-07 


-0.88 


-8.32 


19q13.3 


37 


21431 5_x_at 


CALR 


-1.94 


1.15E-10 


3.93E-08 


-0.82 


-8.32 


19p13.3-p13.2 


38 


208795_s_at 


MCM7 


-2.13 


1.34E-10 


4.34E-08 


-0.81 


-8.24 


7q21.3-q22.1 


39 


200654_at 


P4HB 


-2.24 


1.12E-09 


2.56E-07 


-0.85 


-8.20 


1 7q25 


40 


226123_at 


LOC286180 


-3.50 


1.83E-12 


1 .59 E-09 


-0.75 


-8.19 


8q12.1 


41 


202185_at 


PLOD3 


-1.87 


1.21E-10 


3.99E-08 


-0.80 


-8.18 


7q22 


42 


203675_at 


NUCB2 


-2.33 


6.81 E-11 


2.66E-08 


-0.79 


-8.14 


11p15.1-p14 


43 


219588_s_at 


FLJ2031 1 


-2.28 


6.05E-11 


2.45E-08 


-0.78 


-8.13 


7q36.3 


44 


226694_at 


AKAP2 


-3.97 


1.65E-11 


8.73E-09 


-0.76 


-8.10 


9q31-q33 


45 


227929_at 




-8.16 


4.11E-13 


4.24E-10 


-0.72 


-8.10 




46 


206395_at 


DGKG 


-2.78 


3.30E-10 


9.36E-08 


-0.80 


-8.06 


3q27-q28 


47 


228500_at 


FLJ32891 


-1.54 


1 .23E-08 


1.54E-06 


-0.90 


-8.05 


19q13.12 


48 


22474 1_x_at 




1.45 


3.81 E-08 


3.68E-06 


0.95 


8.03 




49 


202290_at 


PDAP1 


-2.39 


1.70E-10 


5.29E-08 


-0.79 


-8.03 


7q22.1 


50 


206440_at 


LIN7A 


-5.69 


2.39E-12 


1.87E-09 


-0.72 


-8.01 


12q21 






































2.4 


t(15;17) versus rest 






























# 


re . :-j 

affy id 


HUGO name 


fc 


P 


q 


stn 


t 


Map Location 


1 


21 1990_at 


III a l~> n A 4 

H LA-DP A1 


10.44 


2.24E-43 


4.90E-39 


-1.87 


-21.20 


6p21.3 


2 


20442 5_at 


ARHGAP4 


16.94 


1.02E-33 


1.12E-29 


-1.54 


-17.17 


Xq28 


3 


205771_s_at 


AKAP7 


-9.70 


3.81 E-33 


2.78E-29 


-1.46 


-16.49 


6q23 


4 


214450_at 


CTSW 


8.51 


5.55E-13 


4.01 E-11 


2.59 


16.08 


11q13.1 


5 


209732_at 


CLECSF2 


30.16 


2.08E-30 


1.14E-26 


-1.49 


-16.03 


12p13-p12 


6 


221004_s_at 


ITM2C 


5.38 


3.20E-13 


2.39E-11 


2.23 


15.51 


2q37 
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7 


38487_at 


STAB1 


9.09 


1.49E-12 


9.64E-11 


2.57 


15.45 


3p21.31 


8 


212953_x_at 


CALR 


3.17 


2.18E-13 


1.71E-11 


2.05 


15.10 


19p13.3-p13.2 


9 


201137_s_at 


■ ■I a r\nr>/( 

HLA-DPB1 


11.06 


1 .59E-28 


4.96E-25 


-1 .32 


-14.76 


6p21.3 


10 


211474_s_at 


SERPINB6 


-4.44 


7.00E-29 


2.55E-25 


-1.31 


-14.75 


6p25 


11 


201923_at 


PRDX4 


-6.23 


6.52E-29 


2.55E-25 


-1.30 


-14.73 


Xp22.13 


12 


201719_s_at 


EPB41L2 


_ 

12.00 


1.13E-27 


3.10E-24 


-1.29 


-14.36 


6q23 


13 


200931 _s_at 


VCL 


-3.67 


3.72E-26 


5.82E-23 


-1.29 


-14.32 


10q22.1-q23 


14 


213587_s_at 


LOC 155066 


-5.25 


1 .35E-27 


3.29E-24 


-1.27 


-14.25 


7q36.1 


15 


208306_x_at 


HLA-DRB4 


-7.12 


2.28E-27 


4.99E-24 


-1.27 


-14.25 


6p21 .3 


16 


227353_at 


EVER2 


-3.66 


1 .48E-22 


1.01E-19 


-1.34 


-14.24 


17q25.3 


17 


20931 2_x_at 


HLA-DRB1 


-6.66 


1 .03E-26 


1 .89E-23 


-1.26 


-14.15 


6p21.3 


18 


209619_at 


CD74 


-4.60 


4.12E-20 


1.73E-17 


-1.37 


-14.09 


5q32 


19 


217478_s_at 


HLA-DMA 


-5.51 


3.00E-27 


5.97E-24 


-1.24 


-14.00 


6p21.3 


20 


236554_x_at 


EVER2 


-3.57 


8.71 E-24 


1 .00E-20 


-1.26 


-13.76 


17q25.3 


21 


217848_s_at 


PP 


-3.40 


2.67E-22 


1.73E-19 


-1.28 


-13.71 


10q11.1-q24 


22 


200654_at 


P4HB 


2.12 


3.43E-15 


4.12E-13 


1.51 


13.68 


17q25 


23 


204362_at 


SCAP2 


10.71 


2.36E-26 


3.98E-23 


-1.20 


-13.57 


7p21-p15 


24 


203948_s_at 


MPO 


2.73 


1.13E-17 


2.44E-15 


1.36 


13.52 


17q23.1 


25 


204661 _at 


CDW52 


19.63 


3.20E-25 


4.37E-22 


-1.20 


-13.34 


1p36 


26 


225639_at 


SCAP2 


-9.61 


1.19E-25 


1.73E-22 


-1.18 


-13.31 


7p21-p15 


27 


228113_at 


STAT3 


-3.82 


5.05E-23 


4.09E-20 


-1 .19 


-13.12 


17q21 


28 


204670_x_at 


HLA-DRB5 


-5.39 


3.35E-21 


1.75E-18 


-1.21 


-13.01 


6p21.3 


29 


r\ a a r\r\ a — x 

211991_s_at 


ill a r~\ r — \ a a 

HLA-DPA1 


16.57 


2.51 E-24 


3.23E-21 


-1 .17 


-13.00 


6p21.3 


30 


34210_at 


CDW52 


24.45 


1.10E-23 


1 .20E-20 


-1.15 


-12.74 


1p36 


31 


241742_at 


PRAM-1 


-7.25 


7.57E-24 


9.20E-21 


-1.13 


-12.65 


19p13.2 


32 


201034_at 


ADD3 


-4.03 


1.12E-20 


5.12E-18 


-1.16 


-12.59 


10q24.2-q24.3 


33 


227598_at 


LOC1 13763 


-3.70 


2.75E-23 


2.60E-20 


-1.12 


-12.52 


7q35 


34 


223280_x_at 


MS4A6A 


_ 

16.52 


8.79E-23 


6.20E-20 


-1.14 


-12.45 


11q12.1 


35 


226077_at 


FLJ31951 


-5.11 


2.85E-23 


2.60E-20 


-1.10 


-12.42 


5q33.3 


36 


203535_at 


S100A9 


-7.03 


7.53E-23 


5.49E-20 


-1.11 


-12.40 


1q21 


37 


204563_at 


SELL 


-5.94 


1 .82E-23 


1.90E-20 


-1.09 


-12.36 


1q23-q25 


38 


209288_s_at 


CDC42EP3 


-7.09 


2.69E-23 


2.60E-20 


-1.09 


-12.34 


2p21 


39 


23261 7_at 


CTSS 


-5.39 


4.63E-23 


3.89E-20 


-1.10 


-12.33 


1q21 


40 


217716_s_at 


SEC61A1 


2.10 


1.23E-11 


6.32E-10 


1.64 


12.27 


3q21.3 


41 


208982_at 


PECAM1 


-4.56 


3.28E-23 


2.87E-20 


-1.08 


-12.25 


17q23 


42 


221865_at 


DKFZp547P234 


-2.93 


1 .25E-20 


5.58E-18 


-1.12 


-12.21 


9q33.1 


43 


209448_at 


HTATIP2 


-6.78 


6.40E-23 


4.83E-20 


-1.08 


-12.20 


11p15.1 


44 


226885_at 




-2.87 


9.67E-22 


5.57E-19 


-1.10 


-12.18 




45 


224356_x_at 


MS4A6A 


16.66 


4.97E-22 


2.94E-19 


-1.12 


-12.15 


11q12.1 


46 


216899_s_at 


SCAP2 


-5.20 


6.39E-23 


4.83E-20 


-1.07 


-12.15 


7p21-p15 


47 


201753_s_at 


ADD3 


-4.84 


4.78E-22 


2.90E-19 


-1.08 


-12.09 


10q24.2-q24.3 


48 


204361_s_at 


SCAP2 


-7.62 


2.78E-22 


1.74E-19 


-1.06 


-11.98 


7p21-p15 
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49 


238022_at 




4.86 


1.21E-11 


6.21 E-10 


1.55 


11.98 




50 


203299_s_at 


AP1S2 


-3.85 


2.69E-22 


1.73E-19 


-1.06 


-11.95 


Xp22.31 






































2.5 


t(821) versus rest 






























# 


affy id 


HUGO name 


fc 


P 


q 


stn 


t 


Map Location 


1 


225615_at 


LOC126917 


-6.47 


2.83E-26 


8.92E-22 


-1.20 


-13.56 


1p36.13 


2 


215087_at 




-3.10 


8.13E-22 


1.28E-17 


-1.04 


-11.71 




3 


221581_s_at 


WBSCR5 


-5.40 


1.39E-21 


1.46E-17 


-1.03 


-11.59 


7q11.23 


4 


224764_at 


ARHGAP10 


-5.56 


1.92E-21 


1.51 E-1 7 


-1 .02 


-11.54 


10 


5 


201425_at 


ALDH2 


-7.55 


6.33E-21 


3.99E-17 


-1.01 


-11.35 


12q24.2 


6 


238077_at 


MGC27385 


-3.41 


4.41 E-20 


1.74E-16 


-1.02 


-11.34 


3p21.1 


7 


220974_x_at 


BA108L7.2 


-4.55 


4.08E-20 


1.74E-16 


-1.01 


-11.27 


10q24.31 


8 


204494_s_at 


DKFZP434H132 


-3.00 


2.40E-20 


1.26E-16 


-0.99 


-11.14 


15q22.33 


9 


226865__at 




-5.33 


9.25E-20 


2.92E-16 


-0.97 


-10.92 




10 


204495_s_at 


DKFZP434H132 


-2.98 


6.35E-20 


2.22E-16 


-0.96 


-10.91 


15q22.33 


11 


209500_x_at 


TNFSF13 


-3.01 


2.97E-18 


7.82E-15 


-0.95 


-10.56 


17p13.1 


12 


201944_at 


HEXB 


-2.37 


2.13E-18 


6.09E-15 


-0.91 


-10.31 


5q13 


13 


227279_at 


MGC15737 


-2.42 


1.63E-15 


2.06E-12 


-0.95 


-10.19 


Xq22.1 


14 


210314_x_at 


TNFSF13 


-3.48 


1.19E-16 


2.09E-13 


-0.93 


-10.19 


17p13.1 


15 


200788_s_at 


PEA15 


-2.20 


8.80E-15 


6.77E-12 


-0.96 


-10.12 


1q21 .1 


16 


208890_s_at 


PLXNB2 


-3.44 


9.94E-17 


2.07E-13 


-0.91 


-10.06 


22q13.33 


17 


208146_s_at 


CPVL 


1 1.77 


2.37E-17 


5.74E-14 


-0.89 


-9.96 


7p15-p14 


18 


227995_at 




-6.95 


6.87E-17 


1.55E-13 


-0.90 


-9.85 




19 


240572_s_at 




-3.63 


1.05E-16 


2.07E-13 


-0.88 


-9.82 




20 


213147_at 


HOXA10 


-7.74 


1.17E-16 


2.09E-13 


-0.85 


-9.62 


7p15-p14 


21 


214651_s_at 


HOXA9 


_ 

90.11 


5.65E-16 


8.10E-13 


-0.90 


-9.51 


7p15-p14 


22 


217226_s_at 


BA108L7.2 


-2.91 


3.90E-15 


3.84E-12 


-0.87 


-9.49 


10q24.31 


23 


206120_at 


CD33 


-4.19 


2.12E-16 


3.52E-13 


-0.84 


-9.48 


19q13.3 


24 


227276_at 


TEM7R 


-2.62 


2.32E-15 


2.65E-12 


-0.86 


-9.47 


10p12.1 


25 


203320_at 


LNK 


-2.28 


6.01 E-1 5 


4.99E-12 


-0.86 


-9.38 


12q24 


26 


225245_x_at 


H2AFJ 


-3.58 


2.36E-15 


2.65E-12 


-0.84 


-9.36 


12p12 


27 


213150_at 


HOXA10 


_ 

24.16 


1.08E-15 


1.49E-12 


-0.86 


-9.35 


7p15-p14 


28 


207839_s_at 


LOC51754 


-2.83 


1.26E-11 


3.71 E-09 


-0.96 


-9.34 


9p13.1 


29 


200838_at 


CTSB 


-2.68 


4.59E-15 


4.26E-12 


-0.85 


-9.33 


8p22 


30 


205639_at 


AOAH 


-3.88 


1.02E-14 


7.66E-12 


-0.85 


-9.33 


7p14-p12 


31 


224049_at 


KCNK17 


-2.66 


5.20E-16 


8.10E-13 


-0.82 


-9.32 


6p21.1 


32 


207075_jat 


CIAS1 


-3.61 


5.59E-16 


8.10E-13 


-0.82 


-9.30 


1q44 


33 


20301 7_s_at 


SSX2IP 


-2.91 


2.24E-14 


1.50E-11 


-0.85 


-9.21 




34 


201887_at 


IL13RA1 


-2.98 


5.03E-15 


4.41 E-1 2 


-0.83 


-9.19 


Xq24 


35 


220066_at 


CARD 15 


-5.86 


1.34E-15 


1.77E-12 


-0.81 


-9.17 


16p12-q21 


36 


20809 1_s_at 


DKFZP564K0822 


-4.41 


5.18E-15 


4.41 E-12 


-0.81 


-9.08 


7p14.1 


37 


224393_s_at 


CECR6 


-8.27 


4.33E-15 


4.14E-12 


-0.82 


-9.05 




38 


20541 9_at 


EBI2 


-4.20 


2.95E-15 


3.01 E-12 


-0.80 


-9.05 


13q32.2 
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39 


212895_s_at 


ABR 


-2.64 


3.93E-13 


1.88E-10 


-0.86 


-9.04 


17p13.3 


40 


209803_s_at 


TSSC3 


-5.16 


2.42E-15 


2.65E-12 


-0.80 


-9.04 


11p15.5 


41 


238455_at 




-3.51 


2.44E-15 


2.65E-12 


-0.80 


-9.03 




42 


201850_at 


CAPG 


-4.57 


2.96E-15 


3.01 E-12 


-0.80 


-9.03 


2cen-q24 


43 


201105_at 


LGALS1 


-4.17 


1.83E-14 


1.31E-11 


-0.82 


-9.01 


22q13.1 


44 


227853_at 




-2.57 


1.24E-11 


3.69E-09 


-0.91 


-9.01 




45 


201360_at 


CST3 


-3.55 


6.33E-14 


3.70E-11 


-0.83 


-8.99 


20p11.21 


46 


24293 1_at 




-2.79 


3.90E-14 


2.46E-11 


-0.82 


-8.96 




47 


214835_s_at 


SUCLG2 


-3.26 


1.29E-13 


6.91 E-11 


-0.83 


-8.94 


3p14.2 


48 


204057_at 


ICSBP1 


-3.56 


4.82E-15 


4.34E-12 


-0.79 


-8.91 


16q24.1 


49 


223132_s_at 


TRIM8 


-2.24 


4.93E-14 


3.05E-11 


-0.81 


-8.91 


10q24.3 


50 


223398_at 


MGC11115 


-2.37 


1.29E-13 


6.91 E-11 


-0.82 


-8.87 


9q22.2 






































2.6 


tMLL versus rest 






























# 


affy id 


1 J | I /— t /— \ _____ 

HUGO name 


rc 


P 


q 


stn 


t 


Map Location 


1 


20274o_at 


ITM2A 


11.43 


5.44E-24 


1.38E-19 


-1.15 


-12.84 


Xq13.3- 
Xq21.2 


2 


202747_s_at 


ITM2A 


11.40 


1.57E-22 


1.98E-18 


-1.09 


-12.15 


Xq13.3- 
Xq21.2 


3 


200953_s_at 


CCND2 


-3.77 


6.25E-22 


5.27E-18 


-1.04 


-11.75 


12p13 


4 


22583 1_at 


LOC 148894 


-3.55 


4.79E-21 


3.03E-17 


-1.01 


-1 1 .43 


1p36.11 


5 


225344__at 


ERAP140 


-3.84 


1.11E-20 


4.82E-17 


-1.02 


-1 1 .38 


6q22.33 


6 


22651 7_at 


BCAT1 


-8.89 


3.57E-20 


1.00E-16 


-1.01 


-1 1 .22 


12pter-q12 


7 


218966_at 


MY05C 


-2.56 


1.14E-20 


4.82E-17 


-0.99 


-11.21 


15q21 


8 


201830_s_at 


NET1 


-3.67 


1 .39E-20 


5.00E-17 


-0.99 


-11.19 


10p15 


9 


221235_s_at 




-2.20 


2.89E-20 


r\ A r- r— a —f 

9.15E-17 


-0.98 


a a r\ r- 

-1 1 .05 




10 


200665_s_at 


SPARC 


-8.80 


1.44E-19 


3.65E-16 


-0.96 


-10.84 


5q31 .3-q32 


11 


200951 _s_at 


CCND2 


-4.27 


2.90E-19 


6.67E-16 


-0.94 


-10.64 


12p13 


12 


213737_x_at 




2.20 


2.63E-13 


1.19E-10 


1.15 


10.54 




13 


225653_at 




-1.81 


1.42E-18 


2.76E-15 


-0.93 


-10.45 




14 


201829_at 


NET1 


-2.42 


1.18E-18 


2.49E-15 


-0.92 


-10.39 


10p15 


15 


214651_s_at 


HOXA9 


4.96 


6.25E-13 


2.51E-10 


1.14 


10.32 


7p15-p14 


16 


224049_at 


KCNK17 


-2.99 


5.30E-18 


8.93E-15 


-0.92 


-10.25 


6p21.1 


17 


214390_s_at 


BCAT1 


-7.69 


5.12E-18 


8.93E-15 


-0.91 


-10.22 


12pter-q12 


A O 

18 


200952_s_at 


CCND2 


-2.66 


— . r\t~\ i — ac\ 

6.09E-18 


9.63E-15 


-0.90 


-10.15 


12p13 


a rv 

iy 


_:Ub/ol_at 


I AO 1 ILL 


14.52 


ry rnr A ~7 

6.58E-17 


9.79E-14 


-0.96 


-10.07 


3q13.13 


20 


227297_at 




11.16 


1.76E-16 


2.47E-13 


-0.90 


-9.76 




21 


200829_x_at 


ZNF207 


-1.50 


1.67E-15 


1.76E-12 


-0.90 


-9.71 


17q11.2 


22 


220104_at 


ZAP 


-2.31 


3.89E-15 


3.64E-12 


-0.87 


-9.49 


7q34 


23 


241756_at 




-3.07 


— , o — k r — a — ■ 

8.30E-16 


9.54E-13 


-0.85 


-9.48 




24 


225285_at 




-7.25 


3.04E-16 


4.05E-13 


-0.83 


-9.41 




25 


242051_at 




-2.81 


9.79E-16 


1.08E-12 


-0.84 


-9.38 




26 


241133_at 


TRB 


-6.23 


4.52E-16 


5.72E-13 


-0.83 


-9.37 


7q34 


27 


206009 at 


ITGA9 


-2.73 


6.77E-15 


5.70E-12 


-0.85 


-9.31 


3p21.3 
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28 


212667_at 


SPARC 


-4.73 


6.67E-16 


8.03E-13 


-0.82 


-9.29 


5q31.3-q32 


29 


204082_at 


PBX3 


5.29 


3.64E-10 


5.17E-08 


1.33 


9.19 


9q33-q34 


30 


231259_s_at 


CCND2 


-2.27 


6.26E-15 


5.46E-12 


-0.82 


-9.10 


12p13 


31 


219686_at 


HSA250839 


-9.83 


9.59E-15 


7.38E-12 


-0.86 


-9.08 


4p16.2 


32 


223126_s_at 


C1orf21 


-4.19 


2.86E-15 


2.78E-12 


-0.81 


-9.07 


1q25 


33 


23651 3_at 




-2.87 


2.43E-15 


2.46E-12 


-0.80 


-9.05 




34 


226152_at 


TTC7L1 


-2.36 


1.61E-13 


7.74E-11 


-0.85 


-8.99 


14q32.12 


35 


225532_at 


LOC91768 


-3.22 


4.14E-15 


3.74E-12 


-0.80 


-8.97 


18q11.1 


36 


226580_at 


BRMS1 


-2.02 


3.03E-14 


2.02E-11 


-0.81 


-8.91 


14q13.1 


37 


226473_at 


LOC147136 


-3.00 


1.39E-14 


1 .00E-1 1 


-0.80 


-8.89 


17q25.3 


38 


221760_at 


MAN1A1 


-5.47 


1.36E-14 


1 .00E-1 1 


-0.80 


-8.83 


6q22 


39 


23581 8_at 




-6.84 


7.71 E-15 


6.29E-12 


-0.78 


-8.83 




40 


211137_s_at 


ATP2C1 


-1.90 


9.64E-15 


7.38E-12 


-0.78 


-8.78 


3q21-q24 


41 


221581_s_at 


WBSCR5 


2.92 


2.60E-10 


3.98E-08 


1.06 


8.77 


7q11.23 


42 


218825_at 


ZNEU1 


-4.74 


1.82E-14 


1.28E-11 


-0.78 


-8.73 


9q34.3 


43 


240084_at 




-1.75 


1.04E-12 


3.75E-10 


-0.84 


-8.73 




44 


235753_at 




4.76 


5.06E-10 


7.03E-08 


1.10 


8.69 




45 


208116_s_at 


MAN 1 A1 


-3.99 


2.27E-14 


1.55E-11 


-0.77 


-8.68 


6q22 


46 


201015_s_at 


JUP 


-5.04 


2.31 E-13 


1.06E-10 


-0.79 


-8.61 


17q21 


47 


200923 at 


LGALS3BP 


-5.57 


9.28E-14 


5.06E-11 


-0.81 


-8.61 


17q25 


48 


221831_at 


LOC 148894 


-2.53 


7.35E-13 


2.82E-10 


-0.80 


-8.57 


1p36.11 


49 


218899_s_at 


BAALC 


-6.51 


9.62E-14 


5.07E-11 


-0.79 


-8.57 


8q22.3 


50 


205624_at 


CPA3 


13.99 


1.36E-13 


6.83E-11 


-0.81 


-8.55 


3q21-q25 
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3. All-Pairs (AP) 
3.1 M4eo versus PTD 



affy id 


HUGO name 


fc 


o 


n 


stn t 


Map 














Location 


1 235753_at 




-8.40 


1.24E-10 


4.49E-07 


-2.15 


-11.39 


2 206847_s_at 


HOXA7 


-5.18 


2.73E-11 


2.32E-07 


-1.81 


-10 98 7d15-d14 


OCiAAQT v ot 

o i^yr _x_ai 


ivi Y ri I I 


i Q DC 
1(5.00 


O HOC A r\ 


r> o~7 r~ r\-r 
O.Z/t-UY 


2.UO 


10. oo 1op13.13- 














p13.12 


4 213908_at 




-7.48 


4.93E-10 


1.14E-06 


-1.78 


-10.15 


5 213147_at 


HOXA10 


-5.00 


9.04E-11 


4.25E-07 


-1.60 


-9.96 7p15-p14 


6 235359_at 




3.64 


1.33E-11 


1 .95E-07 


1.48 


9.72 


"7 *~* A A C A a i 

7 214651_S_at 


HOXA9 


-17.28 


3.10E-09 


3.62 E-06 


-1.80 


-9.54 7p15-p14 


8 201496_x_at 


MYH11 


5.22 


1.00E-10 


4.25E-07 


1.49 


9.47 16p13.13- 














p13.12 


y ^uuyoo s ai 




O OQ 
Z.ZO 


H CQC A, A 
\ .OOtl-1 1 


1 .yob-UY 


a a r\ 
1 .4U 


A Or HO^HO 

9.00 12p1o 








Q HOC no 


Q OCT 


A. GO 
-1 .OZ 


Q O A O Q 














p11.2 


11 200951_s_at 


CCND2 


3.03 


6.00E-11 


3.81 E-07 


1.33 


8.89 12p13 


12 213150_at 


HOXA10 


-7.80 


1.12E-08 


9.12E-06 


-1.55 


-8.67 7p15-p14 


13 217963_s_at 


NGFRAP1 


-13.60 


1 .45E-08 


1.02E-05 


-1.52 


-8.52 Xq22.1 


14 202746_at 


ITM2A 


3.65 


2.22E-10 


6.27E-07 


1.27 


8.48 Xq13.3- 














Xq21.2 


1 5 202747_s_at 


ITM2A 


3.99 


2.69E-10 


6.85E-07 


1.26 


8.40 Xq13.3- 














VnO H O 

Aqzl .2 


1fi 997*^1 at 




9 rh 


1 T^P HQ 


9 fiOP OR 


1 °.9 




1 7 226352_at 




-7.39 


2.41 E-08 


1 .32E-05 


-1.51 


-8.35 


18 205330_at 


MN1 


8.19 


1.15E-08 


9.12E-06 


1.43 


8.29 22q12.1 


19 226944_at 
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CTSW 


-8.03 
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7.14E-10 


-3.05 
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STAB1 


-8.03 


2.37E-12 
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-3.01 
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ITM2C 


-5.22 


1.41E-13 
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CDW52 


33.75 
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2.99E-12 


2.25E-09 
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5.65 


3.24E-16 


4.28E-12 


2.06 
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7.22 


7.48E-14 
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-9.54 
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-2.41 
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HOXB2 
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7.66E-12 
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14 201425_at 


ALDH2 


7.88 


2.04E-10 


2.42E-07 


1.71 


9.98 12q24.2 
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